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analytical considerations how a very simple Hamiltonian can lead to surprisingly complicated and 
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I. INTRODUCTION 

During the last years, there has been noticed a sig- 
nificant interest in the field of complex networks and a 
lot of interdisciplinary initiatives have been taken aiming 
at investigations of these systems [H, 0, H, 0] . In paral- 
lel with empirical studies of real world networks [H, @], 
theoretical models 0, IH, Q and abstractive mathematical 
tools 0, 11 1 have been developed in order to under- 
stand complex mechanisms hidden behind the network 
functionality. 

Among many studied network models like random 
graphs [ia |. or growing networks there exists a class 
of models, called exponential random graphs, which has 
attracted an attention of the physics community just re- 
cently. The class was first studied in the 1980s by Holland 
and Leinhardt [HI], and later has been developed exten- 
sively by Strauss and others [FJ, |15|, The idea 
diffused from social statistics communities to physical so- 
ciety in recent years, when a number of physicists have 
made theoretical studies of specific models belonging to 
this family @, H [H 0, [H | . 

Exponential random graph model is defined to be not a 
single network but a set of possible networks (ensemble). 
The probability of a particular graph G in this ensemble 
is proportional to e~ H ^ G \ where 



(1) 



and H(G) is called graph Hamiltonian, m, is a collection 
of graph observables that reflect relevant constraints on 
studied graph properties and Qi is the set of fields conju- 
gate to fJij. 

A variety of graph Hamiltonians has been studied so 
far including simple random graphs Q , a network with 
reciprocity [3], the so-called two-star model [2lj], Strauss's 
model of a network with clustering , and others. The- 
oretical analysis of exponential random graph models has 
been developed by a number of authors. In most of cases 
linear models can be solved exactly in the limit of large 
system size. For nonlinear Hamiltonians mean-field the- 
ory and perturbation theory [ij [3, H3] have been applied 



in order to find phase transitions in the network struc- 
tures. 

In this paper we would like to show how a very sim- 
ple Hamiltonian can lead to surprisingly complicated and 
eventful phase diagram where wealth of structural phase 
transitions can not be forecast at first glance. Due to 
complexity of observed structures our methodology is 
mostly concentrated on Monte Carlo simulations. A sim- 
ple mathematical apparatus is also expounded in order 
to reveal details of the observed phenomena. The men- 
tioned calculations, although not so powerful, allow to 
understand when and why a particular transition occurs. 



II. MOTIVATION AND MODEL DESCRIPTION 

The model is defined on network that is composed of 
N nodes and L links, where each node acts as a single 
scientist or a single scientific group and a link between 
two nodes means that there exist scientific collaboration 
between them. 

The original purpose of the model was to describe op- 
timal structures of scientific collaboration in the Sixth 
Framework Programme for Research and Technological 
Development (FP6), which is the European Union ac- 
tion aiming to stimulate and support scientific activities 
conducted at national and international level. One of 
main purposes of the European Commission financing 
scientific projects in FP6 was to strengthen co-operation 
between project partners [23j |. In their proposal, appli- 
cants had to show that one of the aims of the planned 
project is intensification of co-operation between partic- 
ipants. They also had to argue that without such an 
interaction a goal of the project will be not achieved. 
The most popular observable which allows to measure 
effects of the co-operation in social network is clustering 
coefficient introduced by Watts and Strogatz in 1998 [24|. 
The clustering coefficient Cj of a single node i is the pro- 
portion of the number of links between the nodes within 
its neighborhood e divided by the number of links that 
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could possibly exist in the neighborhood 
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where ki represent degree of the considered node. If ki < 
2 then Cj = 0. The global clustering coefficient C is just 
an average of Cj over all nodes. 

The other obvious purpose of the funded project is to 
achieve the highest possible productivity. In our model 
productivity of each scientist i (or local scientific group) 
depends on the number of collaborators k t . The more 
collaborators work with a given scientist, the more pa- 
pers/ideas the scientist can produce. On the other hand, 
however, a large number of collaborators means the ne- 
cessity of parallel concentration on different scientific 
threads which leads to the decrease of productivity. In 
consequence, productivity pi of a single project partici- 
pant can be modelled by a logistic-like equation 



h 



(3) 



where h is an optimal (desired) number of collaborators 
(note that the highest productivity occurs for ki = h). 
Although one could expect that in reality h should be 
described by a kind of the Lotka distribution [25| , here we 
concentrate on the simplest case where h is the same for 
all elements of the system. Productivity P of the whole 
network is just an average over all N nodes normalized 
to unity 
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Hamiltonian of the described model can be written as 
follows: 



H(G) = -9P(G) - aC{G). 



(5) 



Monte Carlo procedure is defined by the following al- 
gorithm: we choose randomly two nodes and try remove 
(add) existing (non existing) link between them. If the 
change leads to the decrease of the initial system energy 
Eq, i.e. AE — E m / p — Eq < 0, where E m / p is the system 
energy after link removal/addition, we accept such a re- 
placement. Otherwise, when AE > 0, we accept it with 



the probability e 
algorithm. 



-AE 



i.e. we apply typical Metropolis 



III. ANALYTICAL CONSIDERATIONS 

In order to show a large variety of structural transitions 
observed in networks described by the Hamiltonian {5]), 
in our Monte Carlo simulations we have decided to fix 
one parameter a = 10 9 , and check behavior of the system 
as a function of the second parameter 9. At the moment 



please note, that since one may think of parameters a and 
9 as inverse temperatures our choice of the large value 
of a corresponds to low temperature limit in classical 
thermodynamics, and makes the considered system less 
susceptible to random effects. Further in the paper, the 
assumption of large a allows us to estimate the critical 
value of 9 from the simple condition AE = 0, which 
means that the stability of the initial stable structure is 
no longer preserved and a new network configuration can 
emerge. 

Fig. [T]shows possible scenarios of structural transitions 
in our model. Arrows represent directions of changes of 
the control parameter 9 given a = 10 9 . As one can see the 
number of possible transition paths is impressive. Later 
we show that the path chosen by the system depends 
mainly on the network size N. Moreover, beside simple 
paths like ABC A or ABEFGHA there may exist much 
more complicated paths like ABEFGHIDFGHA, i.e. 
we have to change parameter 9 from +oo to — oo and 
backward from — oo to +oo two or more times to return 
to the same structure we started! 

To find critical value of the parameter 9 at which a 
particular structural transition occurs one has to ana- 
lyze change in energy AE induced by link addition or 
removal taking into consideration currently existing net- 
work structure. As an example let us analyze transition 
A — B. The structure A corresponds to regular random 
graph, where node degree distribution is given by the 
delta function P{k) — 6(k — h). In this structure produc- 
tivity is maximal P = 1, whereas the number of trian- 
gles contributing to the clustering coefficient is negligible 
small C ~ (assuming that graph is sparse i.e. h -C N 
that is sociologically justified). The transition takes place 
when for a particular value of the parameter 9 energet- 
ically favorable is to add a link which creates the first 
triangle (i.e. a decrease of productivity is sufficiently re- 
warded with an increase of clustering). The described 
situation is schematically presented in Fig. [5^. Energies 
corresponding to both structures depicted in the figure 
are respectively given by 



E = -t 
E p = —(, 
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First, inserting the values of energy into the condition 
AE = E p — E = 0, and next expanding exponential 
functions in Taylor series up to the second order one gets 
the critical value of the control parameter for the consid- 
ered transition A — B 
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As one can see the transition does not depend on the 
system size N . 
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FIG. 1: (Color online) Diagram representing possible transitions between network configurations as a function of parameter 
9. Gray arrows means that the configuration D is composed of two other configurations (C and E), and the two parts of the 
system follow different paths. The figure shows networks with different sizes N just to emphasize particular character of a 
given configuration. 



Much more complicated system behavior is observed 
when the control parameter 9 crosses zero and becomes 
negative (see Fig. [T]). Productivity contribution to en- 
ergy changes sign and all nodes having at the moment 
degree k — h turn out to be in unstable configuration (see 
schematic explanation given in Fig. [3]). For such nodes 
when decrease in clustering is sufficiently rewarded by 
decrease in undesirable productivity the stable configu- 
ration B will be destroyed: some nodes will decrease their 
degrees whereas others will increase them (cf. Fig. 
As one can see in Fig. [T] the considered network may 
follow one of three paths resulting in one of three config- 
urations C, D, or E. 

Unfortunately, due to probabilistic character of the 
Monte Carlo procedure it is hard to calculate analytically 
which direction of changes will be taken by the system of 
a given size N. To check what is really happened during 
the transition we perform numerical simulations, results 
of which are summarized in Fig. [4] The figure shows a 
fraction of isolated nodes as a function of N . As one can 
see for small system sizes degrees of all nodes drop to zero 
and the system transforms into the empty graph (config- 



uration E in Fig. [T]). Above some critical value of A a 
part of nodes condensate together and a fully connected 
subgraph accompanied by isolated nodes appears (con- 
figuration D in Fig. [TJ. Finally in the thermodynam- 
ical limit all nodes condensate and the complete graph 
emerges (configuration C in Fig. [T]). 

Here, we have to stress that configurations C and E 
are rather purely theoretical and can not appear in real 
systems. In the case E, it is connected with the fact 
that in the configuration B a small number of nodes with 
degree larger than h can exist what helps to create nodes 
with maximal degree during the transition. Nevertheless, 
since the configuration D is in fact composed of the two 
remaining configurations C and E, it is still instructive 
to analyze them. 

Now, let us analyze the transition C — A, i.e. transition 
from the complete graph to the regular random graph. 
Because one can not add a link to the complete graph 
the only situation to analyze is removal of a link. The 
described situation is schematically presented in Fig. [2J:. 
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FIG. 2: (Color online) Schematic situations occurring during 
particular transitions used for considerations performed in the 
text, a) transition A — B; b) transition B — D; c) transition 
C — A; d) transition E — F; e) transition H — A. 



A simple calculation gives 



E Q = -9N(N-l)e-^^- 
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which leads to 
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(8) 



(9) 



in the limit of large N. As one can see this transition de- 
pends on the system size N, and in the thermodynamical 
limit N — > oo the critical value Oq-a tends to infinity. 

Now, let us discuss the behavior of the system if the ini- 
tial configuration is E (i.e. the empty network). The first 




FIG. 3: (Color online) Productivity contribution to energy 
for 6 > (upper figure) and for 9 < (lower figure) as a 
function of node degree k. The later situation shows that to 
decrease energy some nodes will reduce their degree (destroy- 
ing existing clusters) and others will increase it (connecting 
two clusters together). 
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FIG. 4: Number of nodes with degree k = as a function 
of system size N. Three thin curves represent a transition 
B — D, while a thick one represents a transition I — D. The 
thick curve shows that for the same value of the parameter h 
the number of isolated nodes is lower after transition I — D 
than after transition B — D. 
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transition E—F occurs when the parameter 9 equals zero. 
Since the productivity does not influence energy of the 
system at this point, links can appear randomly (they do 
not increase energy so they are acceptable in the Monte 
Carlo procedure). First triangles appear in the network 
and the clustering coefficient increases. Such a dynam- 
ics leads to the configuration in which fully connected 
subgraph is surrounded by a number of peripheral nodes 
with degree k — 2 (configuration F in Fig. []}. Because of 
complicated situations in intermediate time steps a rig- 
orous analytical explanation of the transition E — F is 
beyond our abilities. Nevertheless, below we analyze a 
simplified situation, which allows one to understand why 
the transition occurs. 

Thus, let us consider a fully connected subgraph with 
an additional node Q having b links (see Fig. [2ji). Just 
like before, one can analyze what happens with the clus- 
tering coefficient if we add (remove) one link. Fig. [5] 
shows the solution of this problem when size of the fully 
connected subgraph is Np = 20. For a given b clustering 
coefficient of the considered structure is marked by the 
thick line. Thin lines show a new clustering coefficient 
after addition (circles) or removal (triangles) of one link. 
At the beginning, let us assume that we have b < 10. 
To increase clustering coefficient we have to remove one 
link which leads to the configuration with b — 1 links at- 
tached to the peripheral node Q. Further, it is easy to see 
that the process will follow towards removing next links 
belonging to Q until the node will have only two links. 
The node degree can not drop below Uq = 2, because 
for Uq < 2 local clustering coefficient cq suddenly drops 
from 1 to which drastically decreases global clustering 
coefficient of the whole structure. On the other hand, if 
we assume that b > 10, then energetically favorable is to 
add another b+1 link to the node Q. Again, one can see 
that the node Q will try to connect to all other nodes, 
i.e. the node Q becomes a member of the fully connected 
subgraph. 

Starting from the configuration F if one further in- 
creases 9 the productivity P starts to matter. It means 
that above some critical value of this parameter an addi- 
tion of the third link to one out of peripheral nodes with 
degree k = 2 can compensate energy lost coming from de- 
crease of the clustering coefficient. The same reasoning 
explains successive network reorganizations when adding 
next links (up to k = h) to peripheral nodes is energeti- 
cally favorable. 

At the moment, let us note that nodes belonging to 
the fully connected subgraph in the configuration F have 
different degrees. Their degrees are composed of Np — 1 
mutual links and links coming from peripheral nodes with 
degree k = 2 (the peripheral links are randomly dis- 
tributed among the nodes creating the fully connected 
subgraph) . It means that in a finite system there always 
exists a node with the largest degree. It is easy to check 
that addition of a new link to this node makes cluster- 
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FIG. 5: Clustering coefficient C for the system depicted in 
Fig. [2Jl. Arrows show evolution of the system (addition or 
removal of successive links). 
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FIG. 6: Schematic node degree distribution before and af- 
ter transition F — G. The node with the highest degree is 
exaggerated to show its dynamics during the transition. 



ing coefficient of the whole structure worse in the least 
way. It means that above a certain value of 9 new links 
are added to this node making its degree rapidly grow- 
ing. Fig. [6] shows in a schematic way the node degree 
distribution below and at the critical value of 9 for the 
transition. 

As we have described above, if one further increases 
the control parameter 9 the peripheral nodes suddenly 
increase their degrees from k — 3 to k = 4. It gener- 
ates a similar mechanism as described in the previous 
paragraph, i.e. consecutive node with the largest degree 
(expect the one which has already increased its degree 
to N) stepwise increases its connectivity. The network 
configuration arising along the transition corresponds to 
the configuration G presented in Fig. [T] 

Another crucial point in the system evolution is the 
transition G — H, where the fully connected subgraph is 
destroyed in the similar manner as the complete graph 
in the transition C — A. After the transition G — H our 
network consists of several hubs and a large number of 
loosely connected peripheral nodes. 

The configuration H is presented twice in Fig. [1] in 
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FIG. 7: Change of productivity in the star-like structure de- 
picted in Fig. as a function of degree of the central highly 
connected node w, for h = 10. 



order to show two possibilities of the system evolution: 
it can be stable when 9 — > oo, or hubs can be destroyed, 
and the transition H — A takes place. To analyze stability 
of the configuration H is enough to consider a simplified 
structure presented in Fig. [2]3 (note that the simplified 
star-like structure neglects effects of clustering which may 
occur in the original configuration H). In such a struc- 
ture, change of productivity resulting from addition or 
removal of a single link between the hub with a large 
degree w and one of peripheral nodes with degree h (as 
we have already stated the maximal connectivity of the 
peripheral nodes along the transition path F — G — H is 
k = h) is presented in Fig. [7] 
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A given process (link addition or removal) occurs only if 
the change of productivity is positive. It means that if w 
is small the only possibility is to add a new links. When 
h < w < w c (c.f. Fig. [7]) links can only be removed. It 
means that degree of the hub should decrease from w to h, 
and the transition H — A takes place. On the other hand, 
when w > w c , both processes (addition and removal of 
links) are no longer possible. It means that the system 
remains in the stable configuration H. Unfortunately, 
because the analysis neglects clustering it does not allow 
us to calculate the precise critical value of Oh- a- 

Finally, let us analyze the transition H — I (Fig. [1]), 
which occurs when starting from the stable configura- 
tion H one decreases the control parameter 0. On the 
basis of our previous considerations one can predict that 
the configuration I emerges when the input of clustering 
coefficient to the system energy crosses a critical value. 
In a similar way like during the transition A — B nodes 



with low degrees have tendency to form triangles. On 
the other hand, however, because hubs are stable (we 
have shown it in the previous paragraph) instead of many 
separated clusters like in the configuration B the system 
evolves towards the stable configuration I. 

If the parameter 9 is sufficiently negative the configura- 
tion I is destroyed just like the configuration B, and the 
mechanism of this transition is the same: which configu- 
ration (C,D, or E) appears depends on the system size N 
(c.f. thick dotted curve in Fig. HJ). The only difference is 
that for a given N the number of isolated nodes is lower 
in comparison with the transition B — D. 

As we have stated before, in fact, our system in the 
configuration D consists of two configurations C and E. 
It means that at least to some value of the parameter 9 
the part of the system that is equivalent to configuration 
E follows a path D — F — G — H, and the second part, 
equivalent to the configuration C, follows the path C — A. 
At some point, up to now separated parts of the network 
combine together. In figure[TJ in order to make the whole 
picture as clear as possible, we marked paths accessible 
for these two components of the configuration D by gray 
arrows. 

Let us also notice that a path our system follows can 
be really complicated. For example, let us consider a 
network in the configuration A. After transitions A — 
B — D the number of isolated nodes is high, which allows 
to create hubs with very high degrees w > w c as a result 
of the transition F — G. It means that the configuration 
H is stable, and the system follows the path H — I — D . 
Now, the number of isolated nodes is much lower (see Fig. 
HJ. Their number is often too small to once more time 
create hubs with degrees w > w c during the transition 
F—G. Thus, after a series of transitions D—F—G—H the 
system returns to the initial state A. It means, that the 
return to the same state is possible after a complicated 
path A-B-D-F-G-H-I-D-F-G-H-A, 
in which the chain of transitions D — F — G — H — I may 
be repeated several times. 

Finally, we have to stress that a part of phase transi- 
tions we observed are visible only in finite size systems, 
i.e. if the system is large enough a particular phase tran- 
sition can change its character and can lead to develop- 
ment of different structure. Although usually physicists 
use term phase transition in the context of systems in 
thermodynamical limit, our delinquency can be justified 
because the model we study has been proposed to social 
systems where the number of elements is always limited. 



IV. DISCUSSION 

As we have presented in the previous section our sim- 
ple model is characterized by a surprisingly complicated 
and eventful phase diagram with plenty of metastable 
states. Nevertheless, since the model was sociologically 
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FIG. 8: (Color online) Schematic view on network of EU 
projects. Two highly connected coordination actions are sur- 
rounded by a plenty of small research projects. 



motivated, let us discuss the observed network configu- 
rations in the context of scientific collaboration. 

First, the configuration B seems to be the easiest to 
interpret. If each node represents scientific group then 
we see here separated projects consisting of several sci- 
entific groups where each group collaborate with each 
other. The realism of the situation can be questionable 
since real projects can be composed of different number 
of participants, but let us remind that in the model we 
have assumed that the optimal number of collaborators 
h is fixed for all groups. To make the model more real- 
istic we should draw the parameter from the Lotka-like 
distribution [25[, but it would certainly complicate ob- 
tained results and at the moment we were much more 
interested in the description of the observed structural 
phase transitions. 

The second configuration which seems to reflect real- 
world observations is the configuration /. Fig. [5J shows 
the real (although simplified) case of EU projects be- 
ing currently in progress. Let us explain that complex 
systems research in Europe is funded through two Eu- 
ropean Commission actions: NEST (New and Emerg- 
ing Science and Technology, and FET-IST (Future and 
Emerging Technology Information Society Technology) 
23]. It is mainly being done through small projects called 
STREPS (Strategic Targeted Research Projects). Apart 
from them the European Commission is currently fund- 
ing two Coordination Actions to support complex sys- 
tems science: ONCE-CS (funded by IST-FET) and GI- 
ACS (funded by NEST). In Fig. 1 they are presented 
as main nodes which serve as knowledge transfer units 
between particular projects. Part of projects funded 
mainly by NEST collaborate with GIACS, and projects 
funded mainly by IST-FET are supported by ONCE-CS. 



Of course there are projects which take support from 
both coordination actions because aims of both CA's are 
slightly different. Let us stress that the above picture is 
very simplified. The main simplification is that CA's are 
represented by single nodes in Fig. [5J which is evidently 
not true - coordination actions are projects consisting of 
many participants as well as STREPs. 

Finally, the last comment. A careful reader can ask 
what could be the interpretation of negative value of the 
control parameter 9. In fact, it corresponds to the situa- 
tion where groups composed of small number of partici- 
pants are undesirable. The sociological explanation (al- 
though not connected with scientific collaboration) can 
be expressed as follows: or you commune with one global 
social group or you will be separated, which can be rec- 
ognized as the fascist ideology. 



V. CONCLUSIONS 

In this paper we have presented a model of social col- 
laboration. Although the model is expressed by a sim- 
ple Hamiltonian the richness of observed structural phase 
transitions is impressive. Most of them we can only an- 
alyze qualitatively and further studies have to be per- 
formed to clarify reasons for which a given structure ap- 
pears. We uncover many aspects of the studied model 
but in fact much more questions arise during our inves- 
tigations. 

Although simplifications of the model do not allow to 
render in detail the real-world space of scientific projects, 
we have shown that some configurations formed in the 
system remind existing structures of European projects. 
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